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Abstract 

We consider two versions of a simple evolutionary algorithm model for protein folding at 
temperature zero: the (1 + 1)-EA on the LeadingOnes problem. In this schematic model, 
the structure of the protein, which is encoded as a bit-string of length n, is evolved to its 
native conformation through a stochastic pathway of sequential contact bindings. 

We study the asymptotic behavior of the hitting time, in the mean case scenario, under 
two different mutations: the one flip which flips a unique bit chosen uniformly at random 
in the bit-string, and the Bernoulli flip which flips each bit in the bit-string independently 
with probability c/n. For each algorithm we prove a law of large numbers, a central limit 
theorem and compare the performance of the two models. 
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1 Introduction 

Evolutionary algorithms (EAs) are adaptive heuristic search algorithms. They are based on the 
mechanisms of natural selection and are widely used in a great variety of problems, for instance 
population genetics, machine learning and optimization. The task of the EA is to search a fitness 
landscape for maximal values. A population of individuals, considered as candidate solutions to 
the given problem, is evolved under steps of mutation and steps of selection. Each individual 
receives a numerical evaluation, called its fitness score. The dynamics of the EA simulates, 
supposedly like in natural systems, the survival of the fittest among the individuals. Thus, 
individuals of maximum fitness are sought. 
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Despite their numerous heuristic successes, mathematical results describing the behaviour 
of EAs are rather sparse. Among the exceptions are R. Cerf jH], [HI, Y. Rabinovich and A. 
Wigderson [IHl, G. Rudolph [201 , C. Mazza and D. Piau [E], P. Del Moral and A. Guionnet 
|17| . J. Berard ^ and J. Berard and A. Bienveniie [H], (£]■ 

Since EAs usually exhibit complicated dynamics, complexity results are difficult to reach 
and it is a common approach to consider simplified cases. Among the simplified EAs are the so 
called (1 + l)-EAs. These are studied by H. Muhlenbein [IB], T. Back [2], G. Rudolph [20], J. 
Garnier et al. yjj, and S. Droste et al. ^3]; In this paper, we study the time of convergence 
of two versions of a specific (1 + 1)-EA, namely the (1 + 1)-EA on the LeadingOnes problem. 
One of the main motivation for studying these algorithms is that they can be used as a simple 
models for the protein-folding problem. Indeed, the (1 + l)-EAs we focus on, directly fit to the 
model of protein-structure prediction at temperature zero, proposed by biophysicists A. Bakk 
et al. [2]. 

1.1 The physical model 

Proteins typically fold to a unique native or biologically active conformation on time scales from 
10^'^s. to Is. However, if the dynamics of the folding process would follow a random search in 
the conformation space it would result in astronomical time scales. This paradox is known under 
the name of Levinthal's paradox pH]- How do proteins fold to their native state? This is one of 
the intriguing problems of biophysics. Anfinsen ^ showed that the native state is genetically as 
well as thermodynamically determined, i.e. it corresponds to the conformation in which Gibbs 
free energy of the whole system is lowest. 

There are many hypotheses concerning the transition state (TS). One of the view is that the 
(TS)-dynamics consists in a pathway which carries the polypeptide (protein) to the native state 
through a guided descent along the Gibbs free energy landscape (J. A. Schellman [22, K. A. Dill 
et al. [IQI). 

The protein-like model proposed by A. Bakk et al. can be described as follows. The polypep- 
tide chain is equipped with n contact points ci, ...,Cn that we will also call nodes. For i from 
1 to n, Ci is assigned a binary contact variable (t)i that indicates whether it is folded = 1) 
or unfolded ((/>i = 0). In consequence the conformation of the protein is entirely determined by 
the bit-string of length n, (/> = {(f)i^ (1)2, (pn) and the native state corresponds to the bit-string 
where = 1,^1 < i < n, that is (1, 1). There is a bijective mapping from the conformation 
space onto {0, 1}". 

Let io denote the smalest i £ {1, ...,n} for which Cj is unfolded, i.e. for which (pi = 0. We 
call the open part of the protein the set of contact points {Qq, Cjg+i, c„}. 

The assumption about the dynamics of the folding process is that each individual node 
is assigned an energy of —Eq Hi < io, zero otherwise. It can be implemented trhough the 
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Hamiltonian 

TL = -eo(</'i + 4>i(t>2 + ■■■ + (t>i--4n) 

This Hamiltonian can be re-writen in terms of the LeadingOnes function L, defined on the space 
of conformations {0, 1}", which counts the length of the longest prefix of ones in the bit-string: 

L{x) = max{A; > 1 : VI < « < fc, = 1} U {0}. 

Indeed, 

7^ = -eoi(0) = -eo(«o - 1) (1) 

In this model, there is no energy associated to the open part of the protein. It is also known 
under the name of the "zipper-model". In fact, a descent along the energy landscape means both 
the folding of the leftmost uncorrectly folded substructure, i.e node Cip, and the "status-quo" for 
the correctly folded substructures preceding it (on the left), i.e. nodes Cj for i < i^. It means 
that the folding events occur in a specific order: they behave like the individual locks in a zipper. 
In the bit-string framework, lowering the Gibbs free-energy is exactly increasing the size of the 
longest prefix of ones. 

The algorithm proposed by A. Bakk et al. to search the state space {0, 1}" for confor- 
mations of lowest energy, i.e. the native state, is based on the Monte Carlo Metropolis (MCM) 
method ( Binder j3| ). Let T denote the temperature of the whole system, k the usual Boltzmann 
constant, and put (3 = 1/kT. The algorithm proceeds iteratively as follows. The individual (bit- 
string) at time k, Xj^, undergoes a mutation to a new conformation X^, through a stochastic 
process that will be described later. Now, is selected to form the new individual at time 
A; -|- 1, Xk+i, with probability 

-^accept = min(l,exp(-/3A?t:)) , with 

AH = n{x'k)-n{Xk). 

Otherwise Xk+i is a repeat of the old configuration Xk- 

In this paper, we concentrate on the MCM model taken at temperature zero, that we denote 
by MCMq, as well as on a very close version of this algorithm. These algorithms are directly 
connected to the (1 -|- 1)-EA on the LeadingOnes problem. We recall that the dynamics of 
(1 -|- l)-EAs can be formalized through discrete Markov chains as follows: 

1.2 The (1 -|- 1)-EA approach to native conformation prediction 

In the protein model we wish to minimize the Hamiltonian 7i, which is equivalent, according 
to to maximize the LeadingOnes function. More generally, the goal of (1 -|- l)-EAs is to 
optimise some fitness function / : {0, 1}" — > IR. The algorithm proceeds as follows: a unique 
individual, or bit-string, is evolved under the following two-steps iterative process: 
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1. Mutation: 

As in the MCM method, at every evolutionary step, known as generation, the individual in the 
current population at time X^, undergoes a random walk to a new individual X^. 

2. Selection: 

Xk and X'^ are evaluated in terms of their fitness value. Then, the one with the highest fitness 
score is selected to form the generation at time + 1, X^^i: 

If f{X'^) > fiXk), then Xfc+i = Xl Otherwise X^+i = X^. (2) 

The notation (1 + 1)-EA accounts for the fact that we select among one parent and one child. 
Here, we focus on the mean case scenario, in which the first individual Xq is chosen uniformly at 
random in {0, 1}". The reason for this restriction is that the mean case is easier to manage, on 
a mathematical level, in the LeadingOnes framework. When the fitness function is precisely L, 
this algorithm will be denoted by (1 + l)i-EA. We notice that in the case of a fitness landscape 
with local maxima the (1 + 1)-EA method could end in a suboptimal search. But, as long as 
we consider the LeadingOnes problem whose fitness has no local maxima, we are not worried 
about that. In the litterature, there is no actual consensus, in the definition of (1 + l)-EAs, on 
the selection rule. It is sometimes taken to be the following sligthly different one: 

If f{X'k)>fiXk), then Xk+i = X'k. Otherwise X^+i = X^. (3) 

For example, Garnier et al. ^1] consider the first version of the selection rule JSl, whereas Droste 
et al. [T^ focus the second version (EJ. 

We notice that in the LeadingOnes framework, the variant of (1 + l)i-EA with mutation 
rule © is nothing else but MCMq. In order to discriminate them, we shall keep the name 
MCMq. The only difference between these two very close algorithms being that MCMq accepts 
candidates X'^ whose energy is the same as the one of X^, whereas (1 + 1)£-EA does not. 

1.3 Statement of the results 

T„ (respectively r„) denotes the hitting time until some optimal (with regard to the fitness 
function) conformation or individual is sampled by the (1 + 1)-EA (by MCMq respectively). We 
focus on both MCMq and (1 + l)i,-EA in the mean case scenario, under two different kinds of 
mutation: the one flip, which flips a unique bit chosen uniformly at random in the bit-string, 
and the Bernoulli flip, which flips each bit in the bit-string independently with probability cjn. 
As briefly recalled in the introduction, there has already been some work about the complexity 
of some (1 + l)-EAs: 

From Droste, Jansen and Wegener jl^, E{Tn) = 6(nlnn) for the Bernoulli flip applied to a 
linear fitness function on {0, 1}". From Droste et al. ^21, the LeadingOnes function, is solvable 
in mean time B(n^) in the Bernoulli flip scenario. 

Garnier et al. study the OneMax function | • | , which counts the number of ones in the 
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bit-string, in the one flip and the BernoulH flip frameworks, that is 

n 

i=l 

In the one flip case, (T„ — n In n)/n converges in distribution to — In 2 — InZ. In the Bernoulli 
flip case, (r„ — c~^e'^nlnn)/n converges in distribution to —c~^e'^ In Z + C{c), where the law of 
Z is exponential of parameter 1 and C(c) is some c-dependent constant. 

We prove the analog for the LeadingOnes problem of the result of Garnier et al. pi]. This 
improves on the result of Droste et al. ^2]. We prove a law of large numbers, a central limit 
theorem, and we compare the performance of the two models. Finally, we prove that the 
distribution of the hitting time of MCMq, Tn, is the same as the one of r„ in both the one flip 
and the Bernoulli flip scenari. 

Theorem 1.1 (one flip case) (i) For n > 1, E{Tn) = r? 12; 
(ii) As n ^ oo, Tn/E{Tn) converges in probability to 1. 

(Hi) As n ^ oo, (Tn — E{Tn)) /n^^"^ converges in distribution to a centered Gaussian random 
variable of variance 3/4. 

Theorem 1.2 (Bernoulli flip case) i) As oo, E{Tn) ~ m{c)n^, with 

m{c) := (e"-l)/(2c2). 
ii) As n ^ oo, Tn/E{Tn) converges in probability to 1. 

Hi) Furthermore, {Tn — m{c) n'^)/n^^'^ converges in distribution to a centered Gaussian random 
variable of variance o"^(c), with 

a\c) :=3(e2'=-l)/(8c3). 
Note that m(c) > 1/2 for every c > 0. 

Corollary 1.1 As n ^ oo, E{Tn) for the Bernoulli flip case is greater than E(Tn) for the one 
flip case, for any value of c. 

Theorem 1.3 In both the one flip and the bernoulli flip case, Tn and Tn have the same distri- 
bution. 

2 Proof of Theorem 11.11 

The law of T„, conditioned by \Xo\, is the law of a sum of geometric random variables, see 
Lemma f2. 11 This yields Part (i) of the theorem. Since the CLT implies the law of large numbers, 
we then prove the CLT of Part (iii). 
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The distribution of T„ stems from a simple observation. In the (1 + 1)-EA on the LeadingOnes 
problem, a mutation is accepted if and only if it adds 1 to the number of leading ones. As a 
consequence, in the one flip framework, the chain jumps when the leftmost zero is flipped. The 
other flips leave the chain unchanged. Thus, the zeroes in Xq are successively flipped, from left 
to right, until one hits the optimal individal (conformation) 

Here and later in the paper, e = 1/n when we deal with algorithms on strings of length n, 
|x| is the number of ones in x, the geometric law G{p) of parameter p is defined by 

n>l 

and the negative binomial law of parameter {kQ,p), J\fB{kQ,p), puts the following mass on A; > k^: 

Lemma 2.1 If \Xq\ = n — k^, Tn is the sum of k^ i.i.d. Q{e) random variables. Thus, the law 
ofTn is negative binomial of parameter {ko,e). 

Proof of Lemma 12. IL Let tq = and, for every A; > 0, 

Tfc+l = inf{i >Tk;Xi^ Xr^}, CTfe+l = Tk+l - Tk, Xk = Xr^ . (4) 

In words, denotes the position of the chain after its k-th. jump, and \Xi^\ = n — ko + k. The 
leftmost zero of X^ is flipped after a sojourn at Xf^ of length (Jk+i- Thus, (ct/j)^ is i.i.d. of law 
G{e). It remains to note that 

r„ = fJi H h CTfcQ. 

□ 

Proof of Part (iii). Let Ek denote the conditioning on {\Xq\ = n — k}. Since Xq is uniform, 
the law n of |Xo| is binomial (n, 1/2). From Lemma ITT| under Pfc, is the sum of k i.i.d. 
geometric random variables of parameter e. Thus, 

^fc(e-"^") =e-"'=e^ [l - (1 - e) e""] "\ (5) 

Set @n = (Tn — V? jT) jr?!'^ . The decomposition of the Laplace transform of G„ along the values 
of l^ol, the explicit form of /x, and Equation (EJ yield together that 

£;(e-°0") = eV^"/2 2-" e-"*=/"'^' e"" 

fc=o ^ ' 

This can be rewritten as 



1 - (1 - e) e-°/^ 
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with 

/3„ = ee-"/"'^' [l - (1 - £) e""/"'^'] ~^ . 
Recall that e = 1/n. The expansion of /?„ reads 

/3„ = 1 — a/\/n + a^/n + o(l/n). 

This implies that 

as n — > oo. This concludes the proof. □ 

3 Proof of Theorem O 

We first describe the law of T„ conditionally on the values taken by L along the path of (Xfc)fc 
before T^, that is, until the optimal individual (1, 1, ... , 1) is hit. This law is the law of a sum 
of independent geometric random variables, see Lemma We deduce the overall law of T^, 
see Proposition 13.11 This yields part (i) of the theorem. 

As in the previous section, since CLT implies the law of large numbers, we then prove the 
CLT of part (iii). 

We recall that the (1 + 1) EA, in the LeadingOnes framework, accepts a mutation if and only 
if the number of leading ones is increased. Hence the dynamics of the Bernoulli flip algorithm 
proceeds as follows: the chain jumps to a new individual, at time k + 1, if and only if the leading 
ones of Xfc are left unchanged and its leftmost zero is flipped, no matter which values are taken 
by the other bits. 

Here and later in the paper, e = c/n when we deal with algorithms on strings of length n. 
For all i > 0, let p{n,i) = e(l — e)*. As in the one flip framework, Xf^ denotes the position of 
the chain after its k-th jump. We also keep the same definition for ak and r^. 
For all > 0, let 

4 = L{Xk). 



Let Yq be such that 



Xo = il'\0,Yo). 



For k > 1, define Yk and Wk by 

Xk = {l'''-\l,Wk) = {l'\0,Yk). 

Lemmas IH.ll and l,S.2l below are needed to compute the law of of T„ conditionally on {£j) in 
Lemma 



Lemma 3.1 (i) For all k >0, ak depends on the past only through the last score ik~i- That is, 
the law of ak, conditionally on (Xt) ^t^-^-^y , is the law of ak, conditionally on £k-i- 
(a) For all k >0, given {£k-i = the law of ak is Q{p{n,i)). 
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Proof. The sojourn time is the time the algorithm takes to jump from Xf^_i to X^. The 
leftmost zero of is in position i^-i + 1- Thus, one needs to flip the {ik-i + l)-th bit, while 
leaving the first ik-i bits unchanged. Thus, 

(t-i) 



P{ak = t\Xo,...,Xk_i)=e{l-eY''-' 



1-8 1-8)"=- 



□ 



Lemma 3.2 For all k > 1 let = cr{ai,£j : i < k,j < k — 1}, then the law of ik conditionally 
on J-k is the law of £k conditionally on £k~i- 

Proof. Let Pi denote the probability given {^o = i}- Using the strong Markov property: 

-P(4 = ikWk = tk,ik-i = ik-i, ■ ■ ■ ,(^1 = iiJo = io) = P {h = «fcki = tk,io = ik-i) (6) 

Since = ii,ai =t,£o = «o} = {L{Xt) = ii,L{Xt-i) = ... = L{Xo) = io}, since {ai =t,£o = 
io} = {L{Xt) / io, L{Xt-i) = . . . = L{Xq) = io} and using the Markov property on {L{Xt)t) 
we derive the following expression: 

P[h = ik\(^i = tkJo = ik-i) - 



P {L{Xi) ^ ik-i\L{Xo) = ik-i) 
This quantity is indepedent from tfc, hence if we reconsider 

P (4 = ik\(yk = 4,4-1 = ifc-i, • • • ,0"! = = io) = P{ik = ik\h-i = ik-i) 

□ 

Lemma 3.3 Conditionally on {io = zq, • • • ,4-i = ij-iAj = Tn is the sum of J indepen- 
dent geometric random variables with respective parameters p{n,io), ■ ■ ■ ,p(n,ij_i). 

Proof. Given the successive LeadingOnes scores {io = io, ■ ■ ■ , = n} until the optimal indi- 
vidual is hit, Tn = J2k=i ^k- Thus, 

P{Tn = t\lj = n,...,io = io)= P{aj = tj,...,ai=ti\ij = n,...,£o = io) (7) 

Using Lemmas ( l,S.2p and ( LS.ip we can derive by induction: 

J 



P{t,j = n,aj = tj,...,ai=ti,io = io) = W P{^k = ifc|4-i = ik-i)P{(^k = 4|4-i = ik-i] 

k=l 

Hence, since 13^=1 -P(4 = «fc|4-i = «fe-i) = P{^J = n,...Jo = io), 

J 

P{ai = ti, . . . ,aj = tj\ij = n,...,io = io) = Y[^('^k = 4|4-i = ik-i) 



k=l 



8 



We can now replace this last equation in 0, 

J 

P{Tn = t\£j = n,...,io = io) = YlP{(^k = tk\h-i = ik-i) 

ti+-+tk=t k=i 

Let q{n,ik) denote the probabilty distribution of Q{p{n,ik))- 

Then using Lemma we can write, as we recognize a product of convolution: 

P{Tn = t\ij = n, . . . ,io = io) = q{n,ij-i) *■■■* q{n,io){t) 

Thus, given that the search jumps J times until the target (1, 1, ... , 1) is hit and given 

{^0 = io, ■ ■ ■ 1 = IT'} , Tn follows the same distribution as the sum of J independent G(j){n, io)), 

■ • • ; Oipin, ij-i)) random variables. 

Let us focus on P{io = io, . . . = ij-.i,ij = n). In order to compute this quantity, we 

need the following Lemma: 



Lemma 3.4 Let k>l. If Xo is chosen uniformly in {0, 1}", then, given {ik-i = i}, follows 
the uniform distribution on {0, 

jriWk\ik-i = i) =uaoAr-'-'). 



Proof. Let us focus on the case where k = 1. Let Pi denote the probability conditionally on 
{^0 = i}- Let fi denote the probabilty distribution of Yq given {£o = i}- As Xo is chosen 
uniformly in {0, 1}", fi is the uniform distribution on {0, 1}"^* ^ . 

Pi{W^ = w) = Y,P^iXl = w),ai=t) (8) 

t>i 

Since {X^, = (r , l,w),ai = t, L{Xo) = i} = {Xt = (V, 1, w),L{Xt^i) = ... = L{Xo) = i} and 
using the Markov property, Equation ((HJ can be rewritten: 

Pi{Wi = w) = = w))Pi{L{Xi) = L{Xo)Y~^ 

t>i 

Hence, 

^^^^^ - - P.(L(X,) > L{Xo)) 

We recall that the Markov chain jumps from Xo to a conformation of higher fitness at time 1 
if both none of the io first ones of Xo are flipped and the leftmost zero of Xq, is flipped. Hence, 

Pi{L{Xi) > L{Xo)) = e{l - ey (10) 
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On the other hand, as Pi{Yo = u) = ^{u) = l/2"-*-\ 

P,(Xi = (r , 1, w)) = l/2"-'-i ^(^1 = {l\l,w)\X^ = (1\ 0, u)) (11) 

If d{w,u) denotes the Hamming distance between w and u, 

P{Xi = {l\l,w)\Xo = (r,0,u)) =e(l-e)V'^("''")(l-e)"-^-^-''("''") (12) 

Finally Equations Cnil, CUl and (O together with 

«e{o,i}"-'-i 

yield that: 

P,(l^i = w) = XjT-'-^ (13) 
Now, using the strong Markov property we can derive the proof for any k>2. □ 

Lemma 3.5 // Xq is chosen uniformly at random in the state space, for all k > 1 , the condi- 
tional distribution of £k, given {ik-i = ik-i}, satisfies: 

P (ik = jk\ik~i = jk-i) = 2-(j'=-^''=-i) if io + l<jk<n 

= 2-("-^'^--i) tf jk=n 



Proof. This is a direct consequence of Lemma mi □ 

Now that we know the probability distribution of the sequence of the successive LeadingOnes 
scores until the target individual (1, 1, ... , 1) is hit as well as the distribution of conditional 
to the values taken by these LeadingOnes scores, we can compute the probability distribution 

ofTn-. 

Proposition 3.1 If Xq is chosen uniformly at random in {0,1}", then, the probability distri- 
bution of Tn satisfies: 

P{Tn = t) = ^Y. E q{n,ij.i)*---*q{n,iom (14) 

J {o<io<ii<...<ij=n} 

Proof. Xq being chosen uniformly at random in the search space, P{io = io) = ^ip^+i ■ Thus, 
applying Lemma l,S.5| 

p(eo = io,...,ej = n) = ^ 

The result is then a direct consequence of Lemma KS.HI □ 



10 



Proof of part (iii). Set 9^ = (r„ - '^^^^)\n^''^ . 
Thus, 

^(exp(-ae„)) = exp (a^(e^ - 1)^ E |^exp(-a-^)^ (15) 
According to the distribution of T„ given by (|THl : 

^ ^ J io<-<ij-i 

Since the variables {Q{p{n,iiS))i^ are independent, 

J-i 



^ ^ J io<---<«j-i fc=0 

^ n— 1 



A;=0 



where denotes the Laplace transform of Q{p{n,ik)). 

e~°'p{n,ik) 



l-(l-p(n,zfc))e-" 
Recalling that p{n,ik) = e{l — e)* and e = c/n, we derive that: 

Thus, replacing this in (|TK|l . as n goes to oo: 

'3q2 (e^^- 1) 



£^(exp(— Q0„)) ~ exp 



8c 



,3 



We recognize the Laplace transform of a centered gaussian variable of variance ^ — '-. It ends 
the proof. 



4 Proof of Theorem 11.31 

In the MCMo framework, the algorithm has the possibility to visit several distinct protein 
conformations (individuals) whose fitness has the same value before it finally jumps to a new 
individual with higher fitness score. This is not allowed in the (1 + 1)l-EA where the individual 
at time k, X^, is not allowed to jump to an individual X^^i with the same fitness score and 
that would not be Xf^ itself. 

As in the previous sections we denote by {Xk)k the chain defined by the protein conforma- 
tions taken at the times of fitness jumps (Tfc)^. 
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Now, we briefly sketch the proof of Theorem II .HI 
Proof in the one flip case. 

Here, we put e = l/n. 

In the (1 + 1) framework, once Xq has been sampled, the path {Xk)k becomes entirely deter- 
ministic: going trough it means exactly flipping, one at a time, from left to right the zeros of 
Xq. This is not true in the case of MCMq and we cannot adapt directly the proof of Theorem 

o 

The idea of the proof is close to the one Theorem 11.21 First we consider the law of T„ 
conditionally on the values taken by L along the path (Xfc)^ until the ground state is sampled. 
The same basics arguments apply. Lemma [3^ remains true and we find that, as in the one flip 
(1 + 1) framework, {a)k is i.i.d. of law ^(e). Now, Lemma rOl can easily be adapted: 

Lemma 4.1 Conditionally on {^o = ^0;---5^J-i = ij-i^^J = it-}, is the sum of J i.i.d. 
geometric random variables with parameter e, i. e. Tn is negative binomial of parameter ( J, e) . 

Lemma lOl still holds: 
Proof. The proof is a copy the one of Lemma ITU up to Equation 0: 

pm/ ^ Pi{x^ = ii\i,w)) 

^^(^^ = = P.(L(X,) > L(Xo)) 
In the one flip scenario, a unique bit at a time is flipped during the step of mutation. In 
consequence, in order to sample (l*,l,tt;) at time 1, we need to have sampled {V,0,w) at time 
0. Thus, 

P,{Xi = il\ l,w)) = P(Xi = {l\l,w)\Xo = {l\0, w))Pi{Yo = w) = l/(n2"-^-i) (16) 
On an another hand, 

Pi{L{Xi) > L{Xo)) = l/n (17) 

Equalities lfT6|l and (|T7|l applied to Equation ^ return the result for A; = 1. Finally, the strong 
Markov property ends the proof for A: > 1. 

Now, we can prove Theorem II. I-il in the one flip case: 
Proof of Theorem 11.31 . From the above, we derive the distribution of T^: 



P{fn = t) = E AAe(J,e)(t) 

J {o<io<ii<...<ij=n} 



J 

We recall from Lemma [2.11 that conditionally on \Xq\ = n — J, r„ is negative binomial of 
parameter (J, e). As Xq is chosen uniformly at random in {0, 1}", it yields that for all t > 0: 

P{Tn = t)= P{fn = t) 
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□ 



Proof in the Bernoulli flip case. 

The dynamics of MCMq and (1 + 1)^, slighlty differ. Though, we notice that in the MCMq 
framework, Lemmas 13.11 13.21 13.41 13.51 still hold. Now, since the proof of Proposition 13. 1| is 
entirely based on these lemmas, we derive that the probabilty distribution of the hitting time is 
the same in both the MCMq and the (1 + 1)l-EA scenari. 

□ 

5 Conclusion 

After examining the two versions (one flip and Bernoulli flip) of the EAs on which we focused, 
we reach the following conclusion: As (e^ — l)/c^ > 1 for all c S IR+, the expected value of the 
hitting time is higher in the Bernoulli flip than in the one flip. Thus, we can conclude that the 
one flip performs better than any Bernoulli flip, in terms of the expected hitting time. The same 
conclusion has already been derived by Garnier et al. [Tl] for the OneMax problem. 

This better performance of the one flip suggests that, despite the ability of the Bernoulli 
flip to jump from any region of the search space to any other, in a single iteration of the 
search process, the Bernoulli flip results in a slower convergence to a given individual, in the 
LeadingOnes framework. 

In order to explain this phenomenon, as the Markov chain which models our (1 + 1) search 
process accepts a mutation only in case of an increase in the number of leading ones, we should 
point out the following facts: in the Bernoulli flip framework, the closer the algorithm gets to 
the target individual, the longer the algorithm waits until it jumps; on the other hand, in the 
one flip case, the number of leading ones currently present in the bit-string does not interfere 
in the distribution of the time taken for the search to jump. Also, this probability distribution 
remains stable as the search draws near to the optimal individual. 
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